Reinforcement Learning Part 2 - NISHIO Hirokazu's Scrapbox (Auto-translated from Japanese)